Mixture Modeling by Affinity Propagation
نویسندگان
چکیده
Clustering is a fundamental problem in machine learning and has been approached in many ways. Two general and quite different approaches include iteratively fitting a mixture model (e.g., using EM) and linking together pairs of training cases that have high affinity (e.g., using spectral methods). Pair-wise clustering algorithms need not compute sufficient statistics and avoid poor solutions by directly placing similar examples in the same cluster. However, many applications require that each cluster of data be accurately described by a prototype or model, so affinity-based clustering – and its benefits – cannot be directly realized. We describe a technique called “affinity propagation”, which combines the advantages of both approaches. The method learns a mixture model of the data by recursively propagating affinity messages. We demonstrate affinity propagation on the problems of clustering image patches for image segmentation and learning mixtures of gene expression models from microarray data. We find that affinity propagation obtains better solutions than mixtures of Gaussians, the K-medoids algorithm, spectral clustering and hierarchical clustering, and is both able to find a pre-specified number of clusters and is able to automatically determine the number of clusters. Interestingly, affinity propagation can be viewed as belief propagation in a graphical model that accounts for pairwise training case likelihood functions and the identification of cluster centers.
منابع مشابه
Over-Segmentation Based Background Modeling and Foreground Detection with Shadow Removal by Using Hierarchical MRFs
In this paper, we propose a novel over-segmentation based method for the detection of foreground objects from a surveillance video by integrating techniques of background modeling and Markov Random Fields classification. Firstly, we introduce a fast affinity propagation clustering algorithm to produce the over-segmentation of a reference image by taking into account color difference and spatial...
متن کاملA New Knowledge-Based System for Diagnosis of Breast Cancer by a combination of the Affinity Propagation and Firefly Algorithms
Breast cancer has become a widespread disease around the world in young women. Expert systems, developed by data mining techniques, are valuable tools in diagnosis of breast cancer and can help physicians for decision making process. This paper presents a new hybrid data mining approach to classify two groups of breast cancer patients (malignant and benign). The proposed approach, AP-AMBFA, con...
متن کاملPhytochemical Variations in Lemon Verbena (Lippia citriodora H.B.K.) Plantlets Affected by Propagation Methods and Soil Type
Background: Lemon verbena (Lippia citriodora H.B.K.) is an aromatic and medicinal plant of family Verbenaceae, which cultivated in North region of Iran. Objective: Evaluation of phytochemical characters in Lippia citriodora H.B.K. plantlets affected by propagation methods (micro-propagation and stem cutting) cultivated in different soil type (peat moss and mixture soil). Methods: This study w...
متن کاملModeling of Methane Hydrate Decomposition by Using Chemical Affinity
In this work, experimental kinetics data of methane hydrate decomposition at temperatures ranging from 272.15 to 276.15 K and at pressures ranging from 10 to 30 bars were modeled by using chemical affinity. This model proposed a macroscopic model which is independent of any intermediate mechanism like heat or mass transfer. The results show there is good agreement with experimental data. Al...
متن کاملEffects of Mixture Inhomogeneity on the Auto-ignition of Reactants under Hcci Environment
As an attempt at providing insight to develop better modeling strategies for HCCI engines, the ignition and propagation of a reaction front in a premixed fuel/air stream mixed with hotter exhaust gases is computationally investigated using the opposed-flow configuration. The effects of heat and radical transport are studied by imposing various mixing rates on the system. The results show that t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005